Overview
Brought to you by YData
Dataset statistics
| Number of variables | 32 |
|---|---|
| Number of observations | 37609 |
| Missing cells | 147608 |
| Missing cells (%) | 12.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 59.0 MiB |
| Average record size in memory | 1.6 KiB |
Variable types
| Numeric | 1 |
|---|---|
| Text | 21 |
| Categorical | 6 |
| Unsupported | 2 |
| Boolean | 2 |
naics has constant value "0" | Constant |
naics_title has constant value "cross-industry" | Constant |
i_group has constant value "cross-industry" | Constant |
own_code has constant value "1235" | Constant |
annual has constant value "True" | Constant |
hourly has constant value "True" | Constant |
area is highly overall correlated with area_type | High correlation |
area_type is highly overall correlated with area | High correlation |
area_type is highly imbalanced (82.2%) | Imbalance |
o_group is highly imbalanced (86.3%) | Imbalance |
pct_total has 37609 (100.0%) missing values | Missing |
pct_rpt has 37609 (100.0%) missing values | Missing |
annual has 34943 (92.9%) missing values | Missing |
hourly has 37447 (99.6%) missing values | Missing |
pct_total is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
pct_rpt is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2025-11-11 02:56:00.427687 |
|---|---|
| Analysis finished | 2025-11-11 02:56:02.757043 |
| Duration | 2.33 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
area
Real number (ℝ)
High correlation
| Distinct | 54 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.176633 |
| Minimum | 1 |
|---|---|
| Maximum | 78 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 293.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 17 |
| median | 30 |
| Q3 | 44 |
| 95-th percentile | 55 |
| Maximum | 78 |
| Range | 77 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 16.823968 |
|---|---|
| Coefficient of variation (CV) | 0.55751639 |
| Kurtosis | -0.5641297 |
| Mean | 30.176633 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.22257039 |
| Sum | 1134913 |
| Variance | 283.04588 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 6 | 829 | 2.2% |
| 48 | 827 | 2.2% |
| 42 | 818 | 2.2% |
| 36 | 817 | 2.2% |
| 12 | 814 | 2.2% |
| 39 | 810 | 2.2% |
| 26 | 806 | 2.1% |
| 37 | 795 | 2.1% |
| 53 | 794 | 2.1% |
| 51 | 792 | 2.1% |
| Other values (44) | 29507 |
| Value | Count | Frequency (%) |
| 1 | 736 | |
| 2 | 564 | |
| 4 | 743 | |
| 5 | 698 | |
| 6 | 829 | |
| 8 | 765 | |
| 9 | 714 | |
| 10 | 570 | |
| 11 | 522 | |
| 12 | 814 |
| Value | Count | Frequency (%) |
| 78 | 193 | 0.5% |
| 72 | 583 | |
| 66 | 233 | 0.6% |
| 56 | 536 | |
| 55 | 776 | |
| 54 | 665 | |
| 53 | 794 | |
| 51 | 792 | |
| 50 | 584 | |
| 49 | 740 |
area_title
Text
| Distinct | 54 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 12 |
| Mean length | 8.6317371 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | alabama |
|---|---|
| 2nd row | alabama |
| 3rd row | alabama |
| 4th row | alabama |
| 5th row | alabama |
| Value | Count | Frequency (%) |
| new | 2914 | 6.3% |
| carolina | 1554 | 3.4% |
| virginia | 1457 | 3.1% |
| north | 1383 | 3.0% |
| south | 1362 | 2.9% |
| dakota | 1191 | 2.6% |
| california | 829 | 1.8% |
| texas | 827 | 1.8% |
| pennsylvania | 818 | 1.8% |
| york | 817 | 1.8% |
| Other values (50) | 33175 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 44318 | |
| i | 35100 | |
| n | 31976 | 9.8% |
| o | 28324 | 8.7% |
| s | 24201 | 7.5% |
| e | 20680 | 6.4% |
| r | 17894 | 5.5% |
| t | 15034 | 4.6% |
| l | 11719 | 3.6% |
| c | 10667 | 3.3% |
| Other values (16) | 84718 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 324631 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 44318 | |
| i | 35100 | |
| n | 31976 | 9.8% |
| o | 28324 | 8.7% |
| s | 24201 | 7.5% |
| e | 20680 | 6.4% |
| r | 17894 | 5.5% |
| t | 15034 | 4.6% |
| l | 11719 | 3.6% |
| c | 10667 | 3.3% |
| Other values (16) | 84718 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 324631 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 44318 | |
| i | 35100 | |
| n | 31976 | 9.8% |
| o | 28324 | 8.7% |
| s | 24201 | 7.5% |
| e | 20680 | 6.4% |
| r | 17894 | 5.5% |
| t | 15034 | 4.6% |
| l | 11719 | 3.6% |
| c | 10667 | 3.3% |
| Other values (16) | 84718 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 324631 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 44318 | |
| i | 35100 | |
| n | 31976 | 9.8% |
| o | 28324 | 8.7% |
| s | 24201 | 7.5% |
| e | 20680 | 6.4% |
| r | 17894 | 5.5% |
| t | 15034 | 4.6% |
| l | 11719 | 3.6% |
| c | 10667 | 3.3% |
| Other values (16) | 84718 |
area_type
Categorical
High correlation Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
| 2 | |
|---|---|
| 3 | 1009 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 36600 | |
| 3 | 1009 | 2.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 36600 | |
| 3 | 1009 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 36600 | |
| 3 | 1009 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 37609 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 36600 | |
| 3 | 1009 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 37609 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 36600 | |
| 3 | 1009 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 37609 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 36600 | |
| 3 | 1009 | 2.7% |
prim_state
Text
| Distinct | 54 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | al |
|---|---|
| 2nd row | al |
| 3rd row | al |
| 4th row | al |
| 5th row | al |
| Value | Count | Frequency (%) |
| ca | 829 | 2.2% |
| tx | 827 | 2.2% |
| pa | 818 | 2.2% |
| ny | 817 | 2.2% |
| fl | 814 | 2.2% |
| oh | 810 | 2.2% |
| mi | 806 | 2.1% |
| nc | 795 | 2.1% |
| wa | 794 | 2.1% |
| va | 792 | 2.1% |
| Other values (44) | 29507 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8980 | 11.9% |
| n | 8017 | 10.7% |
| m | 6519 | 8.7% |
| i | 5910 | 7.9% |
| c | 4384 | 5.8% |
| t | 4282 | 5.7% |
| o | 3849 | 5.1% |
| d | 3716 | 4.9% |
| l | 3054 | 4.1% |
| v | 2937 | 3.9% |
| Other values (14) | 23570 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 75218 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 8980 | 11.9% |
| n | 8017 | 10.7% |
| m | 6519 | 8.7% |
| i | 5910 | 7.9% |
| c | 4384 | 5.8% |
| t | 4282 | 5.7% |
| o | 3849 | 5.1% |
| d | 3716 | 4.9% |
| l | 3054 | 4.1% |
| v | 2937 | 3.9% |
| Other values (14) | 23570 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 75218 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 8980 | 11.9% |
| n | 8017 | 10.7% |
| m | 6519 | 8.7% |
| i | 5910 | 7.9% |
| c | 4384 | 5.8% |
| t | 4282 | 5.7% |
| o | 3849 | 5.1% |
| d | 3716 | 4.9% |
| l | 3054 | 4.1% |
| v | 2937 | 3.9% |
| Other values (14) | 23570 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 75218 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 8980 | 11.9% |
| n | 8017 | 10.7% |
| m | 6519 | 8.7% |
| i | 5910 | 7.9% |
| c | 4384 | 5.8% |
| t | 4282 | 5.7% |
| o | 3849 | 5.1% |
| d | 3716 | 4.9% |
| l | 3054 | 4.1% |
| v | 2937 | 3.9% |
| Other values (14) | 23570 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 37609 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 37609 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 37609 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 37609 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 37609 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 37609 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 37609 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 37609 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 37609 |
naics_title
Categorical
Constant
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.5 MiB |
| cross-industry |
|---|
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | cross-industry |
|---|---|
| 2nd row | cross-industry |
| 3rd row | cross-industry |
| 4th row | cross-industry |
| 5th row | cross-industry |
Common Values
| Value | Count | Frequency (%) |
| cross-industry | 37609 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cross-industry | 37609 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 112827 | |
| r | 75218 | |
| c | 37609 | 7.1% |
| o | 37609 | 7.1% |
| - | 37609 | 7.1% |
| i | 37609 | 7.1% |
| n | 37609 | 7.1% |
| d | 37609 | 7.1% |
| u | 37609 | 7.1% |
| t | 37609 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 526526 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 112827 | |
| r | 75218 | |
| c | 37609 | 7.1% |
| o | 37609 | 7.1% |
| - | 37609 | 7.1% |
| i | 37609 | 7.1% |
| n | 37609 | 7.1% |
| d | 37609 | 7.1% |
| u | 37609 | 7.1% |
| t | 37609 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 526526 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 112827 | |
| r | 75218 | |
| c | 37609 | 7.1% |
| o | 37609 | 7.1% |
| - | 37609 | 7.1% |
| i | 37609 | 7.1% |
| n | 37609 | 7.1% |
| d | 37609 | 7.1% |
| u | 37609 | 7.1% |
| t | 37609 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 526526 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 112827 | |
| r | 75218 | |
| c | 37609 | 7.1% |
| o | 37609 | 7.1% |
| - | 37609 | 7.1% |
| i | 37609 | 7.1% |
| n | 37609 | 7.1% |
| d | 37609 | 7.1% |
| u | 37609 | 7.1% |
| t | 37609 | 7.1% |
i_group
Categorical
Constant
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.5 MiB |
| cross-industry |
|---|
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | cross-industry |
|---|---|
| 2nd row | cross-industry |
| 3rd row | cross-industry |
| 4th row | cross-industry |
| 5th row | cross-industry |
Common Values
| Value | Count | Frequency (%) |
| cross-industry | 37609 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cross-industry | 37609 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 112827 | |
| r | 75218 | |
| c | 37609 | 7.1% |
| o | 37609 | 7.1% |
| - | 37609 | 7.1% |
| i | 37609 | 7.1% |
| n | 37609 | 7.1% |
| d | 37609 | 7.1% |
| u | 37609 | 7.1% |
| t | 37609 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 526526 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 112827 | |
| r | 75218 | |
| c | 37609 | 7.1% |
| o | 37609 | 7.1% |
| - | 37609 | 7.1% |
| i | 37609 | 7.1% |
| n | 37609 | 7.1% |
| d | 37609 | 7.1% |
| u | 37609 | 7.1% |
| t | 37609 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 526526 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 112827 | |
| r | 75218 | |
| c | 37609 | 7.1% |
| o | 37609 | 7.1% |
| - | 37609 | 7.1% |
| i | 37609 | 7.1% |
| n | 37609 | 7.1% |
| d | 37609 | 7.1% |
| u | 37609 | 7.1% |
| t | 37609 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 526526 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 112827 | |
| r | 75218 | |
| c | 37609 | 7.1% |
| o | 37609 | 7.1% |
| - | 37609 | 7.1% |
| i | 37609 | 7.1% |
| n | 37609 | 7.1% |
| d | 37609 | 7.1% |
| u | 37609 | 7.1% |
| t | 37609 | 7.1% |
own_code
Categorical
Constant
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| 1235 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1235 |
|---|---|
| 2nd row | 1235 |
| 3rd row | 1235 |
| 4th row | 1235 |
| 5th row | 1235 |
Common Values
| Value | Count | Frequency (%) |
| 1235 | 37609 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1235 | 37609 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 37609 | |
| 2 | 37609 | |
| 3 | 37609 | |
| 5 | 37609 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 150436 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 37609 | |
| 2 | 37609 | |
| 3 | 37609 | |
| 5 | 37609 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 150436 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 37609 | |
| 2 | 37609 | |
| 3 | 37609 | |
| 5 | 37609 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 150436 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 37609 | |
| 2 | 37609 | |
| 3 | 37609 | |
| 5 | 37609 |
occ_code
Text
| Distinct | 854 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 00-0000 |
|---|---|
| 2nd row | 11-0000 |
| 3rd row | 11-1011 |
| 4th row | 11-1021 |
| 5th row | 11-1031 |
| Value | Count | Frequency (%) |
| 00-0000 | 54 | 0.1% |
| 43-4081 | 54 | 0.1% |
| 43-0000 | 54 | 0.1% |
| 43-3011 | 54 | 0.1% |
| 43-3021 | 54 | 0.1% |
| 43-3031 | 54 | 0.1% |
| 43-3051 | 54 | 0.1% |
| 43-4051 | 54 | 0.1% |
| 43-4171 | 54 | 0.1% |
| 43-9021 | 54 | 0.1% |
| Other values (844) | 37069 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 57208 | |
| - | 37609 | |
| 0 | 35050 | |
| 2 | 33492 | |
| 3 | 26635 | |
| 9 | 23264 | |
| 4 | 17815 | 6.8% |
| 5 | 17356 | 6.6% |
| 7 | 9404 | 3.6% |
| 6 | 3547 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 263263 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 57208 | |
| - | 37609 | |
| 0 | 35050 | |
| 2 | 33492 | |
| 3 | 26635 | |
| 9 | 23264 | |
| 4 | 17815 | 6.8% |
| 5 | 17356 | 6.6% |
| 7 | 9404 | 3.6% |
| 6 | 3547 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 263263 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 57208 | |
| - | 37609 | |
| 0 | 35050 | |
| 2 | 33492 | |
| 3 | 26635 | |
| 9 | 23264 | |
| 4 | 17815 | 6.8% |
| 5 | 17356 | 6.6% |
| 7 | 9404 | 3.6% |
| 6 | 3547 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 263263 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 57208 | |
| - | 37609 | |
| 0 | 35050 | |
| 2 | 33492 | |
| 3 | 26635 | |
| 9 | 23264 | |
| 4 | 17815 | 6.8% |
| 5 | 17356 | 6.6% |
| 7 | 9404 | 3.6% |
| 6 | 3547 | 1.3% |
occ_title
Text
| Distinct | 854 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.3 MiB |
Length
| Max length | 112 |
|---|---|
| Median length | 75 |
| Mean length | 35.15725 |
| Min length | 6 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | all occupations |
|---|---|
| 2nd row | management occupations |
| 3rd row | chief executives |
| 4th row | general and operations managers |
| 5th row | legislators |
| Value | Count | Frequency (%) |
| and | 21836 | 13.7% |
| workers | 3923 | 2.5% |
| other | 3170 | 2.0% |
| operators | 3125 | 2.0% |
| all | 2957 | 1.9% |
| teachers | 2390 | 1.5% |
| technicians | 2360 | 1.5% |
| except | 1873 | 1.2% |
| postsecondary | 1793 | 1.1% |
| machine | 1774 | 1.1% |
| Other values (1097) | 114080 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 134227 | |
| 121672 | 9.2% | |
| a | 113041 | 8.5% |
| s | 112517 | 8.5% |
| r | 104784 | 7.9% |
| n | 95406 | 7.2% |
| i | 91936 | 7.0% |
| t | 90408 | 6.8% |
| o | 72084 | 5.5% |
| c | 66741 | 5.0% |
| Other values (21) | 319413 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1322229 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 134227 | |
| 121672 | 9.2% | |
| a | 113041 | 8.5% |
| s | 112517 | 8.5% |
| r | 104784 | 7.9% |
| n | 95406 | 7.2% |
| i | 91936 | 7.0% |
| t | 90408 | 6.8% |
| o | 72084 | 5.5% |
| c | 66741 | 5.0% |
| Other values (21) | 319413 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1322229 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 134227 | |
| 121672 | 9.2% | |
| a | 113041 | 8.5% |
| s | 112517 | 8.5% |
| r | 104784 | 7.9% |
| n | 95406 | 7.2% |
| i | 91936 | 7.0% |
| t | 90408 | 6.8% |
| o | 72084 | 5.5% |
| c | 66741 | 5.0% |
| Other values (21) | 319413 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1322229 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 134227 | |
| 121672 | 9.2% | |
| a | 113041 | 8.5% |
| s | 112517 | 8.5% |
| r | 104784 | 7.9% |
| n | 95406 | 7.2% |
| i | 91936 | 7.0% |
| t | 90408 | 6.8% |
| o | 72084 | 5.5% |
| c | 66741 | 5.0% |
| Other values (21) | 319413 |
o_group
Categorical
Imbalance
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| detailed | |
|---|---|
| major | 1188 |
| total | 54 |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.900928 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | total |
|---|---|
| 2nd row | major |
| 3rd row | detailed |
| 4th row | detailed |
| 5th row | detailed |
Common Values
| Value | Count | Frequency (%) |
| detailed | 36367 | |
| major | 1188 | 3.2% |
| total | 54 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| detailed | 36367 | |
| major | 1188 | 3.2% |
| total | 54 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 72734 | |
| e | 72734 | |
| a | 37609 | |
| t | 36475 | |
| l | 36421 | |
| i | 36367 | |
| o | 1242 | 0.4% |
| m | 1188 | 0.4% |
| j | 1188 | 0.4% |
| r | 1188 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 297146 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 72734 | |
| e | 72734 | |
| a | 37609 | |
| t | 36475 | |
| l | 36421 | |
| i | 36367 | |
| o | 1242 | 0.4% |
| m | 1188 | 0.4% |
| j | 1188 | 0.4% |
| r | 1188 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 297146 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 72734 | |
| e | 72734 | |
| a | 37609 | |
| t | 36475 | |
| l | 36421 | |
| i | 36367 | |
| o | 1242 | 0.4% |
| m | 1188 | 0.4% |
| j | 1188 | 0.4% |
| r | 1188 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 297146 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 72734 | |
| e | 72734 | |
| a | 37609 | |
| t | 36475 | |
| l | 36421 | |
| i | 36367 | |
| o | 1242 | 0.4% |
| m | 1188 | 0.4% |
| j | 1188 | 0.4% |
| r | 1188 | 0.4% |
tot_emp
Text
| Distinct | 3995 |
|---|---|
| Distinct (%) | 10.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 3.4316254 |
| Min length | 2 |
Unique
| Unique | 2242 ? |
|---|---|
| Unique (%) | 6.0% |
Sample
| 1st row | 2091480 |
|---|---|
| 2nd row | 110240 |
| 3rd row | 830 |
| 4th row | 32370 |
| 5th row | 1120 |
| Value | Count | Frequency (%) |
| 1353 | 3.6% | |
| 40 | 688 | 1.8% |
| 60 | 623 | 1.7% |
| 70 | 612 | 1.6% |
| 50 | 608 | 1.6% |
| 90 | 538 | 1.4% |
| 80 | 491 | 1.3% |
| 110 | 489 | 1.3% |
| 100 | 482 | 1.3% |
| 130 | 449 | 1.2% |
| Other values (3985) | 31276 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 42318 | |
| 1 | 16774 | 13.0% |
| 2 | 11629 | 9.0% |
| 3 | 10019 | 7.8% |
| 4 | 9192 | 7.1% |
| 5 | 8312 | 6.4% |
| 6 | 7778 | 6.0% |
| 7 | 7145 | 5.5% |
| 8 | 6804 | 5.3% |
| 9 | 6383 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 129060 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 42318 | |
| 1 | 16774 | 13.0% |
| 2 | 11629 | 9.0% |
| 3 | 10019 | 7.8% |
| 4 | 9192 | 7.1% |
| 5 | 8312 | 6.4% |
| 6 | 7778 | 6.0% |
| 7 | 7145 | 5.5% |
| 8 | 6804 | 5.3% |
| 9 | 6383 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 129060 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 42318 | |
| 1 | 16774 | 13.0% |
| 2 | 11629 | 9.0% |
| 3 | 10019 | 7.8% |
| 4 | 9192 | 7.1% |
| 5 | 8312 | 6.4% |
| 6 | 7778 | 6.0% |
| 7 | 7145 | 5.5% |
| 8 | 6804 | 5.3% |
| 9 | 6383 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 129060 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 42318 | |
| 1 | 16774 | 13.0% |
| 2 | 11629 | 9.0% |
| 3 | 10019 | 7.8% |
| 4 | 9192 | 7.1% |
| 5 | 8312 | 6.4% |
| 6 | 7778 | 6.0% |
| 7 | 7145 | 5.5% |
| 8 | 6804 | 5.3% |
| 9 | 6383 | 4.9% |
emp_prse
Text
| Distinct | 501 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.2187242 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0.9 |
| 3rd row | 17.6 |
| 4th row | 1.6 |
| 5th row | 8.7 |
| Value | Count | Frequency (%) |
| 1353 | 3.6% | |
| 0 | 380 | 1.0% |
| 6.5 | 241 | 0.6% |
| 4 | 239 | 0.6% |
| 4.7 | 232 | 0.6% |
| 2.5 | 230 | 0.6% |
| 3.4 | 230 | 0.6% |
| 5.1 | 229 | 0.6% |
| 4.9 | 226 | 0.6% |
| 6.9 | 226 | 0.6% |
| Other values (491) | 34023 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 32226 | |
| 1 | 17800 | |
| 2 | 11961 | 9.9% |
| 3 | 9711 | 8.0% |
| 4 | 8523 | 7.0% |
| 5 | 7368 | 6.1% |
| 6 | 7211 | 6.0% |
| 7 | 6878 | 5.7% |
| 8 | 6618 | 5.5% |
| 9 | 6255 | 5.2% |
| Other values (2) | 6502 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 121053 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 32226 | |
| 1 | 17800 | |
| 2 | 11961 | 9.9% |
| 3 | 9711 | 8.0% |
| 4 | 8523 | 7.0% |
| 5 | 7368 | 6.1% |
| 6 | 7211 | 6.0% |
| 7 | 6878 | 5.7% |
| 8 | 6618 | 5.5% |
| 9 | 6255 | 5.2% |
| Other values (2) | 6502 | 5.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 121053 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 32226 | |
| 1 | 17800 | |
| 2 | 11961 | 9.9% |
| 3 | 9711 | 8.0% |
| 4 | 8523 | 7.0% |
| 5 | 7368 | 6.1% |
| 6 | 7211 | 6.0% |
| 7 | 6878 | 5.7% |
| 8 | 6618 | 5.5% |
| 9 | 6255 | 5.2% |
| Other values (2) | 6502 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 121053 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 32226 | |
| 1 | 17800 | |
| 2 | 11961 | 9.9% |
| 3 | 9711 | 8.0% |
| 4 | 8523 | 7.0% |
| 5 | 7368 | 6.1% |
| 6 | 7211 | 6.0% |
| 7 | 6878 | 5.7% |
| 8 | 6618 | 5.5% |
| 9 | 6255 | 5.2% |
| Other values (2) | 6502 | 5.4% |
jobs_1000
Text
| Distinct | 7322 |
|---|---|
| Distinct (%) | 19.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 4.8407828 |
| Min length | 1 |
Unique
| Unique | 4164 ? |
|---|---|
| Unique (%) | 11.1% |
Sample
| 1st row | 1000 |
|---|---|
| 2nd row | 52.708 |
| 3rd row | 0.396 |
| 4th row | 15.476 |
| 5th row | 0.533 |
| Value | Count | Frequency (%) |
| 1353 | 3.6% | |
| 0.029 | 106 | 0.3% |
| 0.022 | 104 | 0.3% |
| 0.052 | 95 | 0.3% |
| 0.075 | 93 | 0.2% |
| 0.037 | 92 | 0.2% |
| 0.057 | 91 | 0.2% |
| 0.069 | 88 | 0.2% |
| 0.063 | 86 | 0.2% |
| 0.048 | 86 | 0.2% |
| Other values (7312) | 35415 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 36492 | |
| . | 36185 | |
| 1 | 19482 | |
| 2 | 14888 | |
| 3 | 12527 | 6.9% |
| 4 | 11093 | 6.1% |
| 5 | 10590 | 5.8% |
| 6 | 9984 | 5.5% |
| 7 | 9783 | 5.4% |
| 8 | 9334 | 5.1% |
| Other values (2) | 11699 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 182057 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 36492 | |
| . | 36185 | |
| 1 | 19482 | |
| 2 | 14888 | |
| 3 | 12527 | 6.9% |
| 4 | 11093 | 6.1% |
| 5 | 10590 | 5.8% |
| 6 | 9984 | 5.5% |
| 7 | 9783 | 5.4% |
| 8 | 9334 | 5.1% |
| Other values (2) | 11699 | 6.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 182057 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 36492 | |
| . | 36185 | |
| 1 | 19482 | |
| 2 | 14888 | |
| 3 | 12527 | 6.9% |
| 4 | 11093 | 6.1% |
| 5 | 10590 | 5.8% |
| 6 | 9984 | 5.5% |
| 7 | 9783 | 5.4% |
| 8 | 9334 | 5.1% |
| Other values (2) | 11699 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 182057 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 36492 | |
| . | 36185 | |
| 1 | 19482 | |
| 2 | 14888 | |
| 3 | 12527 | 6.9% |
| 4 | 11093 | 6.1% |
| 5 | 10590 | 5.8% |
| 6 | 9984 | 5.5% |
| 7 | 9783 | 5.4% |
| 8 | 9334 | 5.1% |
| Other values (2) | 11699 | 6.4% |
loc_quotient
Text
| Distinct | 852 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 3.8118004 |
| Min length | 1 |
Unique
| Unique | 283 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0.74 |
| 3rd row | 0.29 |
| 4th row | 0.67 |
| 5th row | 3.1 |
| Value | Count | Frequency (%) |
| 1353 | 3.6% | |
| 0.92 | 356 | 0.9% |
| 0.89 | 355 | 0.9% |
| 0.93 | 353 | 0.9% |
| 0.96 | 352 | 0.9% |
| 1 | 351 | 0.9% |
| 0.91 | 350 | 0.9% |
| 0.94 | 347 | 0.9% |
| 0.87 | 347 | 0.9% |
| 1.02 | 344 | 0.9% |
| Other values (842) | 33101 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 35846 | |
| 0 | 23313 | |
| 1 | 20261 | |
| 2 | 9008 | 6.3% |
| 9 | 7666 | 5.3% |
| 3 | 7656 | 5.3% |
| 8 | 7533 | 5.3% |
| 7 | 7492 | 5.2% |
| 6 | 7376 | 5.1% |
| 4 | 7257 | 5.1% |
| Other values (2) | 9950 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 143358 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 35846 | |
| 0 | 23313 | |
| 1 | 20261 | |
| 2 | 9008 | 6.3% |
| 9 | 7666 | 5.3% |
| 3 | 7656 | 5.3% |
| 8 | 7533 | 5.3% |
| 7 | 7492 | 5.2% |
| 6 | 7376 | 5.1% |
| 4 | 7257 | 5.1% |
| Other values (2) | 9950 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 143358 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 35846 | |
| 0 | 23313 | |
| 1 | 20261 | |
| 2 | 9008 | 6.3% |
| 9 | 7666 | 5.3% |
| 3 | 7656 | 5.3% |
| 8 | 7533 | 5.3% |
| 7 | 7492 | 5.2% |
| 6 | 7376 | 5.1% |
| 4 | 7257 | 5.1% |
| Other values (2) | 9950 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 143358 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 35846 | |
| 0 | 23313 | |
| 1 | 20261 | |
| 2 | 9008 | 6.3% |
| 9 | 7666 | 5.3% |
| 3 | 7656 | 5.3% |
| 8 | 7533 | 5.3% |
| 7 | 7492 | 5.2% |
| 6 | 7376 | 5.1% |
| 4 | 7257 | 5.1% |
| Other values (2) | 9950 | 6.9% |
pct_total
Unsupported
Missing Rejected Unsupported
| Missing | 37609 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 293.9 KiB |
pct_rpt
Unsupported
Missing Rejected Unsupported
| Missing | 37609 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 293.9 KiB |
h_mean
Text
| Distinct | 6713 |
|---|---|
| Distinct (%) | 17.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.573985 |
| Min length | 1 |
Unique
| Unique | 2070 ? |
|---|---|
| Unique (%) | 5.5% |
Sample
| 1st row | 26.61 |
|---|---|
| 2nd row | 57.05 |
| 3rd row | 99.61 |
| 4th row | 64.8 |
| 5th row | * |
| Value | Count | Frequency (%) |
| 3128 | 8.3% | |
| 22.35 | 27 | 0.1% |
| 21.14 | 27 | 0.1% |
| 20.64 | 26 | 0.1% |
| 24.47 | 26 | 0.1% |
| 20.95 | 25 | 0.1% |
| 24.4 | 25 | 0.1% |
| 21.85 | 25 | 0.1% |
| 20.38 | 25 | 0.1% |
| 22.67 | 25 | 0.1% |
| Other values (6702) | 34250 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 34131 | |
| 2 | 23689 | |
| 3 | 17232 | |
| 1 | 17205 | |
| 4 | 14332 | |
| 5 | 12033 | 7.0% |
| 6 | 11453 | 6.7% |
| 7 | 10844 | 6.3% |
| 8 | 10746 | 6.2% |
| 9 | 10545 | 6.1% |
| Other values (3) | 9813 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 172023 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 34131 | |
| 2 | 23689 | |
| 3 | 17232 | |
| 1 | 17205 | |
| 4 | 14332 | |
| 5 | 12033 | 7.0% |
| 6 | 11453 | 6.7% |
| 7 | 10844 | 6.3% |
| 8 | 10746 | 6.2% |
| 9 | 10545 | 6.1% |
| Other values (3) | 9813 | 5.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 172023 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 34131 | |
| 2 | 23689 | |
| 3 | 17232 | |
| 1 | 17205 | |
| 4 | 14332 | |
| 5 | 12033 | 7.0% |
| 6 | 11453 | 6.7% |
| 7 | 10844 | 6.3% |
| 8 | 10746 | 6.2% |
| 9 | 10545 | 6.1% |
| Other values (3) | 9813 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 172023 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 34131 | |
| 2 | 23689 | |
| 3 | 17232 | |
| 1 | 17205 | |
| 4 | 14332 | |
| 5 | 12033 | 7.0% |
| 6 | 11453 | 6.7% |
| 7 | 10844 | 6.3% |
| 8 | 10746 | 6.2% |
| 9 | 10545 | 6.1% |
| Other values (3) | 9813 | 5.7% |
a_mean
Text
| Distinct | 11304 |
|---|---|
| Distinct (%) | 30.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.0847669 |
| Min length | 1 |
Unique
| Unique | 3901 ? |
|---|---|
| Unique (%) | 10.4% |
Sample
| 1st row | 55350 |
|---|---|
| 2nd row | 118670 |
| 3rd row | 207190 |
| 4th row | 134790 |
| 5th row | 36570 |
| Value | Count | Frequency (%) |
| 635 | 1.7% | |
| 48570 | 16 | < 0.1% |
| 39430 | 16 | < 0.1% |
| 45900 | 15 | < 0.1% |
| 50760 | 15 | < 0.1% |
| 56050 | 15 | < 0.1% |
| 58350 | 15 | < 0.1% |
| 52200 | 15 | < 0.1% |
| 44110 | 15 | < 0.1% |
| 45280 | 15 | < 0.1% |
| Other values (11293) | 36837 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 49672 | |
| 4 | 18772 | 9.8% |
| 5 | 17659 | 9.2% |
| 1 | 17251 | 9.0% |
| 6 | 16042 | 8.4% |
| 3 | 15884 | 8.3% |
| 7 | 14802 | 7.7% |
| 8 | 13773 | 7.2% |
| 2 | 13387 | 7.0% |
| 9 | 13356 | 7.0% |
| Other values (2) | 635 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 191233 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49672 | |
| 4 | 18772 | 9.8% |
| 5 | 17659 | 9.2% |
| 1 | 17251 | 9.0% |
| 6 | 16042 | 8.4% |
| 3 | 15884 | 8.3% |
| 7 | 14802 | 7.7% |
| 8 | 13773 | 7.2% |
| 2 | 13387 | 7.0% |
| 9 | 13356 | 7.0% |
| Other values (2) | 635 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 191233 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49672 | |
| 4 | 18772 | 9.8% |
| 5 | 17659 | 9.2% |
| 1 | 17251 | 9.0% |
| 6 | 16042 | 8.4% |
| 3 | 15884 | 8.3% |
| 7 | 14802 | 7.7% |
| 8 | 13773 | 7.2% |
| 2 | 13387 | 7.0% |
| 9 | 13356 | 7.0% |
| Other values (2) | 635 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 191233 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49672 | |
| 4 | 18772 | 9.8% |
| 5 | 17659 | 9.2% |
| 1 | 17251 | 9.0% |
| 6 | 16042 | 8.4% |
| 3 | 15884 | 8.3% |
| 7 | 14802 | 7.7% |
| 8 | 13773 | 7.2% |
| 2 | 13387 | 7.0% |
| 9 | 13356 | 7.0% |
| Other values (2) | 635 | 0.3% |
mean_prse
Text
| Distinct | 293 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 2.8386025 |
| Min length | 1 |
Unique
| Unique | 19 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 0.3 |
|---|---|
| 2nd row | 0.7 |
| 3rd row | 4.6 |
| 4th row | 1.6 |
| 5th row | 3.8 |
| Value | Count | Frequency (%) |
| 1.1 | 1046 | 2.8% |
| 1.4 | 993 | 2.6% |
| 0.9 | 991 | 2.6% |
| 0.8 | 991 | 2.6% |
| 1.2 | 985 | 2.6% |
| 0.7 | 981 | 2.6% |
| 1.3 | 940 | 2.5% |
| 1 | 918 | 2.4% |
| 1.6 | 882 | 2.3% |
| 1.5 | 878 | 2.3% |
| Other values (283) | 28004 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 33312 | |
| 1 | 15445 | |
| 2 | 10926 | 10.2% |
| 3 | 8148 | 7.6% |
| 4 | 7027 | 6.6% |
| 0 | 6144 | 5.8% |
| 5 | 5901 | 5.5% |
| 6 | 5458 | 5.1% |
| 7 | 5028 | 4.7% |
| 8 | 4668 | 4.4% |
| Other values (2) | 4700 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 106757 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 33312 | |
| 1 | 15445 | |
| 2 | 10926 | 10.2% |
| 3 | 8148 | 7.6% |
| 4 | 7027 | 6.6% |
| 0 | 6144 | 5.8% |
| 5 | 5901 | 5.5% |
| 6 | 5458 | 5.1% |
| 7 | 5028 | 4.7% |
| 8 | 4668 | 4.4% |
| Other values (2) | 4700 | 4.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 106757 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 33312 | |
| 1 | 15445 | |
| 2 | 10926 | 10.2% |
| 3 | 8148 | 7.6% |
| 4 | 7027 | 6.6% |
| 0 | 6144 | 5.8% |
| 5 | 5901 | 5.5% |
| 6 | 5458 | 5.1% |
| 7 | 5028 | 4.7% |
| 8 | 4668 | 4.4% |
| Other values (2) | 4700 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 106757 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 33312 | |
| 1 | 15445 | |
| 2 | 10926 | 10.2% |
| 3 | 8148 | 7.6% |
| 4 | 7027 | 6.6% |
| 0 | 6144 | 5.8% |
| 5 | 5901 | 5.5% |
| 6 | 5458 | 5.1% |
| 7 | 5028 | 4.7% |
| 8 | 4668 | 4.4% |
| Other values (2) | 4700 | 4.4% |
h_pct10
Text
| Distinct | 4259 |
|---|---|
| Distinct (%) | 11.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.4513813 |
| Min length | 1 |
Unique
| Unique | 1054 ? |
|---|---|
| Unique (%) | 2.8% |
Sample
| 1st row | 11.31 |
|---|---|
| 2nd row | 24.57 |
| 3rd row | 50.46 |
| 4th row | 24.24 |
| 5th row | * |
| Value | Count | Frequency (%) |
| 3128 | 8.3% | |
| 15 | 256 | 0.7% |
| 14 | 172 | 0.5% |
| 17 | 123 | 0.3% |
| 12 | 116 | 0.3% |
| 9.5 | 115 | 0.3% |
| 15.13 | 113 | 0.3% |
| 20.48 | 90 | 0.2% |
| 15.69 | 85 | 0.2% |
| 18 | 83 | 0.2% |
| Other values (4248) | 33328 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 33129 | |
| 1 | 28703 | |
| 2 | 19662 | |
| 3 | 13589 | |
| 4 | 11188 | 6.7% |
| 7 | 10862 | 6.5% |
| 8 | 10713 | 6.4% |
| 5 | 10530 | 6.3% |
| 6 | 10261 | 6.1% |
| 9 | 9622 | 5.7% |
| Other values (3) | 9153 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 167412 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 33129 | |
| 1 | 28703 | |
| 2 | 19662 | |
| 3 | 13589 | |
| 4 | 11188 | 6.7% |
| 7 | 10862 | 6.5% |
| 8 | 10713 | 6.4% |
| 5 | 10530 | 6.3% |
| 6 | 10261 | 6.1% |
| 9 | 9622 | 5.7% |
| Other values (3) | 9153 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 167412 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 33129 | |
| 1 | 28703 | |
| 2 | 19662 | |
| 3 | 13589 | |
| 4 | 11188 | 6.7% |
| 7 | 10862 | 6.5% |
| 8 | 10713 | 6.4% |
| 5 | 10530 | 6.3% |
| 6 | 10261 | 6.1% |
| 9 | 9622 | 5.7% |
| Other values (3) | 9153 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 167412 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 33129 | |
| 1 | 28703 | |
| 2 | 19662 | |
| 3 | 13589 | |
| 4 | 11188 | 6.7% |
| 7 | 10862 | 6.5% |
| 8 | 10713 | 6.4% |
| 5 | 10530 | 6.3% |
| 6 | 10261 | 6.1% |
| 9 | 9622 | 5.7% |
| Other values (3) | 9153 | 5.5% |
h_pct25
Text
| Distinct | 5028 |
|---|---|
| Distinct (%) | 13.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.4836342 |
| Min length | 1 |
Unique
| Unique | 1315 ? |
|---|---|
| Unique (%) | 3.5% |
Sample
| 1st row | 14.74 |
|---|---|
| 2nd row | 35.03 |
| 3rd row | 62.96 |
| 4th row | 35.92 |
| 5th row | * |
| Value | Count | Frequency (%) |
| 3277 | 8.7% | |
| 15 | 118 | 0.3% |
| 14 | 77 | 0.2% |
| 18 | 75 | 0.2% |
| 17 | 71 | 0.2% |
| 20 | 58 | 0.2% |
| 19.02 | 57 | 0.2% |
| 25 | 56 | 0.1% |
| 21 | 53 | 0.1% |
| 22 | 51 | 0.1% |
| Other values (5017) | 33716 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 33347 | |
| 1 | 23714 | |
| 2 | 22336 | |
| 3 | 14946 | |
| 4 | 11627 | 6.9% |
| 7 | 11280 | 6.7% |
| 8 | 11150 | 6.6% |
| 5 | 10403 | 6.2% |
| 6 | 10206 | 6.1% |
| 9 | 10048 | 6.0% |
| Other values (3) | 9568 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 168625 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 33347 | |
| 1 | 23714 | |
| 2 | 22336 | |
| 3 | 14946 | |
| 4 | 11627 | 6.9% |
| 7 | 11280 | 6.7% |
| 8 | 11150 | 6.6% |
| 5 | 10403 | 6.2% |
| 6 | 10206 | 6.1% |
| 9 | 10048 | 6.0% |
| Other values (3) | 9568 | 5.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 168625 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 33347 | |
| 1 | 23714 | |
| 2 | 22336 | |
| 3 | 14946 | |
| 4 | 11627 | 6.9% |
| 7 | 11280 | 6.7% |
| 8 | 11150 | 6.6% |
| 5 | 10403 | 6.2% |
| 6 | 10206 | 6.1% |
| 9 | 10048 | 6.0% |
| Other values (3) | 9568 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 168625 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 33347 | |
| 1 | 23714 | |
| 2 | 22336 | |
| 3 | 14946 | |
| 4 | 11627 | 6.9% |
| 7 | 11280 | 6.7% |
| 8 | 11150 | 6.6% |
| 5 | 10403 | 6.2% |
| 6 | 10206 | 6.1% |
| 9 | 10048 | 6.0% |
| Other values (3) | 9568 | 5.7% |
h_median
Text
| Distinct | 5914 |
|---|---|
| Distinct (%) | 15.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.4800181 |
| Min length | 1 |
Unique
| Unique | 1545 ? |
|---|---|
| Unique (%) | 4.1% |
Sample
| 1st row | 21.07 |
|---|---|
| 2nd row | 48.39 |
| 3rd row | 79.04 |
| 4th row | 51.12 |
| 5th row | * |
| Value | Count | Frequency (%) |
| 3561 | 9.5% | |
| 20 | 70 | 0.2% |
| 25 | 57 | 0.2% |
| 23.56 | 44 | 0.1% |
| 18 | 44 | 0.1% |
| 28.82 | 41 | 0.1% |
| 23 | 39 | 0.1% |
| 21 | 36 | 0.1% |
| 15 | 36 | 0.1% |
| 22 | 35 | 0.1% |
| Other values (5903) | 33646 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 33284 | |
| 2 | 23292 | |
| 1 | 18087 | |
| 3 | 16578 | |
| 4 | 12986 | 7.7% |
| 8 | 11227 | 6.7% |
| 7 | 11217 | 6.7% |
| 5 | 10807 | 6.4% |
| 6 | 10776 | 6.4% |
| 9 | 10151 | 6.0% |
| Other values (3) | 10084 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 168489 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 33284 | |
| 2 | 23292 | |
| 1 | 18087 | |
| 3 | 16578 | |
| 4 | 12986 | 7.7% |
| 8 | 11227 | 6.7% |
| 7 | 11217 | 6.7% |
| 5 | 10807 | 6.4% |
| 6 | 10776 | 6.4% |
| 9 | 10151 | 6.0% |
| Other values (3) | 10084 | 6.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 168489 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 33284 | |
| 2 | 23292 | |
| 1 | 18087 | |
| 3 | 16578 | |
| 4 | 12986 | 7.7% |
| 8 | 11227 | 6.7% |
| 7 | 11217 | 6.7% |
| 5 | 10807 | 6.4% |
| 6 | 10776 | 6.4% |
| 9 | 10151 | 6.0% |
| Other values (3) | 10084 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 168489 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 33284 | |
| 2 | 23292 | |
| 1 | 18087 | |
| 3 | 16578 | |
| 4 | 12986 | 7.7% |
| 8 | 11227 | 6.7% |
| 7 | 11217 | 6.7% |
| 5 | 10807 | 6.4% |
| 6 | 10776 | 6.4% |
| 9 | 10151 | 6.0% |
| Other values (3) | 10084 | 6.0% |
h_pct75
Text
| Distinct | 6888 |
|---|---|
| Distinct (%) | 18.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.4611396 |
| Min length | 1 |
Unique
| Unique | 1830 ? |
|---|---|
| Unique (%) | 4.9% |
Sample
| 1st row | 30.82 |
|---|---|
| 2nd row | 68.5 |
| 3rd row | 106.69 |
| 4th row | 78.26 |
| 5th row | * |
| Value | Count | Frequency (%) |
| 3830 | 10.2% | |
| 36.2 | 55 | 0.1% |
| 20 | 44 | 0.1% |
| 35.6 | 41 | 0.1% |
| 35.08 | 39 | 0.1% |
| 30 | 37 | 0.1% |
| 22.5 | 32 | 0.1% |
| 25 | 31 | 0.1% |
| 21 | 31 | 0.1% |
| 23 | 29 | 0.1% |
| Other values (6877) | 33440 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 33122 | |
| 2 | 21363 | |
| 3 | 17858 | |
| 4 | 14120 | |
| 1 | 13751 | |
| 5 | 12058 | 7.2% |
| 6 | 11824 | 7.0% |
| 7 | 11507 | 6.9% |
| 8 | 11309 | 6.7% |
| 9 | 10370 | 6.2% |
| Other values (3) | 10497 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 167779 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 33122 | |
| 2 | 21363 | |
| 3 | 17858 | |
| 4 | 14120 | |
| 1 | 13751 | |
| 5 | 12058 | 7.2% |
| 6 | 11824 | 7.0% |
| 7 | 11507 | 6.9% |
| 8 | 11309 | 6.7% |
| 9 | 10370 | 6.2% |
| Other values (3) | 10497 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 167779 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 33122 | |
| 2 | 21363 | |
| 3 | 17858 | |
| 4 | 14120 | |
| 1 | 13751 | |
| 5 | 12058 | 7.2% |
| 6 | 11824 | 7.0% |
| 7 | 11507 | 6.9% |
| 8 | 11309 | 6.7% |
| 9 | 10370 | 6.2% |
| Other values (3) | 10497 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 167779 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 33122 | |
| 2 | 21363 | |
| 3 | 17858 | |
| 4 | 14120 | |
| 1 | 13751 | |
| 5 | 12058 | 7.2% |
| 6 | 11824 | 7.0% |
| 7 | 11507 | 6.9% |
| 8 | 11309 | 6.7% |
| 9 | 10370 | 6.2% |
| Other values (3) | 10497 | 6.3% |
h_pct90
Text
| Distinct | 7760 |
|---|---|
| Distinct (%) | 20.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.4265468 |
| Min length | 1 |
Unique
| Unique | 1994 ? |
|---|---|
| Unique (%) | 5.3% |
Sample
| 1st row | 47.51 |
|---|---|
| 2nd row | 98.03 |
| 3rd row | # |
| 4th row | # |
| 5th row | * |
| Value | Count | Frequency (%) |
| 4310 | 11.5% | |
| 35.6 | 82 | 0.2% |
| 92.25 | 48 | 0.1% |
| 38.03 | 33 | 0.1% |
| 36.2 | 32 | 0.1% |
| 36.24 | 30 | 0.1% |
| 54.34 | 29 | 0.1% |
| 30 | 27 | 0.1% |
| 22 | 26 | 0.1% |
| 35 | 26 | 0.1% |
| Other values (7749) | 32966 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 32675 | |
| 2 | 18070 | |
| 3 | 17897 | |
| 4 | 14910 | |
| 5 | 12975 | 7.8% |
| 6 | 12483 | 7.5% |
| 1 | 12270 | 7.4% |
| 8 | 11652 | 7.0% |
| 7 | 11637 | 7.0% |
| 9 | 10615 | 6.4% |
| Other values (3) | 11294 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 166478 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 32675 | |
| 2 | 18070 | |
| 3 | 17897 | |
| 4 | 14910 | |
| 5 | 12975 | 7.8% |
| 6 | 12483 | 7.5% |
| 1 | 12270 | 7.4% |
| 8 | 11652 | 7.0% |
| 7 | 11637 | 7.0% |
| 9 | 10615 | 6.4% |
| Other values (3) | 11294 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 166478 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 32675 | |
| 2 | 18070 | |
| 3 | 17897 | |
| 4 | 14910 | |
| 5 | 12975 | 7.8% |
| 6 | 12483 | 7.5% |
| 1 | 12270 | 7.4% |
| 8 | 11652 | 7.0% |
| 7 | 11637 | 7.0% |
| 9 | 10615 | 6.4% |
| Other values (3) | 11294 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 166478 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 32675 | |
| 2 | 18070 | |
| 3 | 17897 | |
| 4 | 14910 | |
| 5 | 12975 | 7.8% |
| 6 | 12483 | 7.5% |
| 1 | 12270 | 7.4% |
| 8 | 11652 | 7.0% |
| 7 | 11637 | 7.0% |
| 9 | 10615 | 6.4% |
| Other values (3) | 11294 | 6.8% |
a_pct10
Text
| Distinct | 7333 |
|---|---|
| Distinct (%) | 19.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.953309 |
| Min length | 1 |
Unique
| Unique | 2083 ? |
|---|---|
| Unique (%) | 5.5% |
Sample
| 1st row | 23520 |
|---|---|
| 2nd row | 51100 |
| 3rd row | 104950 |
| 4th row | 50410 |
| 5th row | 18270 |
| Value | Count | Frequency (%) |
| 635 | 1.7% | |
| 31200 | 240 | 0.6% |
| 29120 | 169 | 0.4% |
| 24960 | 120 | 0.3% |
| 19760 | 119 | 0.3% |
| 31470 | 103 | 0.3% |
| 35360 | 97 | 0.3% |
| 42600 | 86 | 0.2% |
| 32640 | 83 | 0.2% |
| 37440 | 66 | 0.2% |
| Other values (7322) | 35891 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 48726 | |
| 3 | 22890 | |
| 4 | 18786 | 10.1% |
| 2 | 16832 | 9.0% |
| 5 | 14527 | 7.8% |
| 6 | 14231 | 7.6% |
| 1 | 12572 | 6.7% |
| 7 | 12503 | 6.7% |
| 9 | 12445 | 6.7% |
| 8 | 12142 | 6.5% |
| Other values (2) | 635 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 186289 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 48726 | |
| 3 | 22890 | |
| 4 | 18786 | 10.1% |
| 2 | 16832 | 9.0% |
| 5 | 14527 | 7.8% |
| 6 | 14231 | 7.6% |
| 1 | 12572 | 6.7% |
| 7 | 12503 | 6.7% |
| 9 | 12445 | 6.7% |
| 8 | 12142 | 6.5% |
| Other values (2) | 635 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 186289 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 48726 | |
| 3 | 22890 | |
| 4 | 18786 | 10.1% |
| 2 | 16832 | 9.0% |
| 5 | 14527 | 7.8% |
| 6 | 14231 | 7.6% |
| 1 | 12572 | 6.7% |
| 7 | 12503 | 6.7% |
| 9 | 12445 | 6.7% |
| 8 | 12142 | 6.5% |
| Other values (2) | 635 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 186289 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 48726 | |
| 3 | 22890 | |
| 4 | 18786 | 10.1% |
| 2 | 16832 | 9.0% |
| 5 | 14527 | 7.8% |
| 6 | 14231 | 7.6% |
| 1 | 12572 | 6.7% |
| 7 | 12503 | 6.7% |
| 9 | 12445 | 6.7% |
| 8 | 12142 | 6.5% |
| Other values (2) | 635 | 0.3% |
a_pct25
Text
| Distinct | 8572 |
|---|---|
| Distinct (%) | 22.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.9641043 |
| Min length | 1 |
Unique
| Unique | 2611 ? |
|---|---|
| Unique (%) | 6.9% |
Sample
| 1st row | 30660 |
|---|---|
| 2nd row | 72870 |
| 3rd row | 130950 |
| 4th row | 74720 |
| 5th row | 20950 |
| Value | Count | Frequency (%) |
| 784 | 2.1% | |
| 31200 | 110 | 0.3% |
| 29120 | 73 | 0.2% |
| 37440 | 55 | 0.1% |
| 41600 | 53 | 0.1% |
| 35360 | 52 | 0.1% |
| 43680 | 48 | 0.1% |
| 39560 | 46 | 0.1% |
| 59950 | 43 | 0.1% |
| 45760 | 42 | 0.1% |
| Other values (8561) | 36303 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 48794 | |
| 3 | 20126 | |
| 4 | 19313 | 10.3% |
| 5 | 15725 | 8.4% |
| 6 | 15451 | 8.3% |
| 2 | 14198 | 7.6% |
| 7 | 13871 | 7.4% |
| 8 | 12958 | 6.9% |
| 9 | 12927 | 6.9% |
| 1 | 12548 | 6.7% |
| Other values (2) | 784 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 186695 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 48794 | |
| 3 | 20126 | |
| 4 | 19313 | 10.3% |
| 5 | 15725 | 8.4% |
| 6 | 15451 | 8.3% |
| 2 | 14198 | 7.6% |
| 7 | 13871 | 7.4% |
| 8 | 12958 | 6.9% |
| 9 | 12927 | 6.9% |
| 1 | 12548 | 6.7% |
| Other values (2) | 784 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 186695 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 48794 | |
| 3 | 20126 | |
| 4 | 19313 | 10.3% |
| 5 | 15725 | 8.4% |
| 6 | 15451 | 8.3% |
| 2 | 14198 | 7.6% |
| 7 | 13871 | 7.4% |
| 8 | 12958 | 6.9% |
| 9 | 12927 | 6.9% |
| 1 | 12548 | 6.7% |
| Other values (2) | 784 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 186695 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 48794 | |
| 3 | 20126 | |
| 4 | 19313 | 10.3% |
| 5 | 15725 | 8.4% |
| 6 | 15451 | 8.3% |
| 2 | 14198 | 7.6% |
| 7 | 13871 | 7.4% |
| 8 | 12958 | 6.9% |
| 9 | 12927 | 6.9% |
| 1 | 12548 | 6.7% |
| Other values (2) | 784 | 0.4% |
a_median
Text
| Distinct | 10087 |
|---|---|
| Distinct (%) | 26.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.0015954 |
| Min length | 1 |
Unique
| Unique | 3184 ? |
|---|---|
| Unique (%) | 8.5% |
Sample
| 1st row | 43830 |
|---|---|
| 2nd row | 100640 |
| 3rd row | 164400 |
| 4th row | 106330 |
| 5th row | 26990 |
| Value | Count | Frequency (%) |
| 1078 | 2.9% | |
| 41600 | 64 | 0.2% |
| 59950 | 39 | 0.1% |
| 52000 | 39 | 0.1% |
| 49000 | 34 | 0.1% |
| 31200 | 33 | 0.1% |
| 37440 | 33 | 0.1% |
| 45760 | 29 | 0.1% |
| 47840 | 28 | 0.1% |
| 58390 | 27 | 0.1% |
| Other values (10076) | 36205 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 49370 | |
| 4 | 18511 | 9.8% |
| 3 | 16578 | 8.8% |
| 6 | 16429 | 8.7% |
| 5 | 16117 | 8.6% |
| 1 | 15317 | 8.1% |
| 7 | 14780 | 7.9% |
| 8 | 13496 | 7.2% |
| 9 | 13464 | 7.2% |
| 2 | 12965 | 6.9% |
| Other values (2) | 1078 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 188105 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49370 | |
| 4 | 18511 | 9.8% |
| 3 | 16578 | 8.8% |
| 6 | 16429 | 8.7% |
| 5 | 16117 | 8.6% |
| 1 | 15317 | 8.1% |
| 7 | 14780 | 7.9% |
| 8 | 13496 | 7.2% |
| 9 | 13464 | 7.2% |
| 2 | 12965 | 6.9% |
| Other values (2) | 1078 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 188105 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49370 | |
| 4 | 18511 | 9.8% |
| 3 | 16578 | 8.8% |
| 6 | 16429 | 8.7% |
| 5 | 16117 | 8.6% |
| 1 | 15317 | 8.1% |
| 7 | 14780 | 7.9% |
| 8 | 13496 | 7.2% |
| 9 | 13464 | 7.2% |
| 2 | 12965 | 6.9% |
| Other values (2) | 1078 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 188105 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49370 | |
| 4 | 18511 | 9.8% |
| 3 | 16578 | 8.8% |
| 6 | 16429 | 8.7% |
| 5 | 16117 | 8.6% |
| 1 | 15317 | 8.1% |
| 7 | 14780 | 7.9% |
| 8 | 13496 | 7.2% |
| 9 | 13464 | 7.2% |
| 2 | 12965 | 6.9% |
| Other values (2) | 1078 | 0.6% |
a_pct75
Text
| Distinct | 11787 |
|---|---|
| Distinct (%) | 31.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.0784653 |
| Min length | 1 |
Unique
| Unique | 4116 ? |
|---|---|
| Unique (%) | 10.9% |
Sample
| 1st row | 64110 |
|---|---|
| 2nd row | 142480 |
| 3rd row | 221910 |
| 4th row | 162780 |
| 5th row | 41760 |
| Value | Count | Frequency (%) |
| 1365 | 3.6% | |
| 75300 | 51 | 0.1% |
| 41600 | 40 | 0.1% |
| 72970 | 39 | 0.1% |
| 74050 | 35 | 0.1% |
| 62400 | 30 | 0.1% |
| 52000 | 28 | 0.1% |
| 95640 | 26 | 0.1% |
| 82220 | 26 | 0.1% |
| 191880 | 24 | 0.1% |
| Other values (11776) | 35945 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 49800 | |
| 1 | 19691 | 10.3% |
| 4 | 16817 | 8.8% |
| 6 | 16144 | 8.5% |
| 5 | 16044 | 8.4% |
| 7 | 15504 | 8.1% |
| 3 | 14541 | 7.6% |
| 8 | 14197 | 7.4% |
| 9 | 14000 | 7.3% |
| 2 | 12893 | 6.8% |
| Other values (2) | 1365 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 190996 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49800 | |
| 1 | 19691 | 10.3% |
| 4 | 16817 | 8.8% |
| 6 | 16144 | 8.5% |
| 5 | 16044 | 8.4% |
| 7 | 15504 | 8.1% |
| 3 | 14541 | 7.6% |
| 8 | 14197 | 7.4% |
| 9 | 14000 | 7.3% |
| 2 | 12893 | 6.8% |
| Other values (2) | 1365 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 190996 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49800 | |
| 1 | 19691 | 10.3% |
| 4 | 16817 | 8.8% |
| 6 | 16144 | 8.5% |
| 5 | 16044 | 8.4% |
| 7 | 15504 | 8.1% |
| 3 | 14541 | 7.6% |
| 8 | 14197 | 7.4% |
| 9 | 14000 | 7.3% |
| 2 | 12893 | 6.8% |
| Other values (2) | 1365 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 190996 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49800 | |
| 1 | 19691 | 10.3% |
| 4 | 16817 | 8.8% |
| 6 | 16144 | 8.5% |
| 5 | 16044 | 8.4% |
| 7 | 15504 | 8.1% |
| 3 | 14541 | 7.6% |
| 8 | 14197 | 7.4% |
| 9 | 14000 | 7.3% |
| 2 | 12893 | 6.8% |
| Other values (2) | 1365 | 0.7% |
a_pct90
Text
| Distinct | 13288 |
|---|---|
| Distinct (%) | 35.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.12614 |
| Min length | 1 |
Unique
| Unique | 5016 ? |
|---|---|
| Unique (%) | 13.3% |
Sample
| 1st row | 98810 |
|---|---|
| 2nd row | 203900 |
| 3rd row | # |
| 4th row | # |
| 5th row | 63900 |
| Value | Count | Frequency (%) |
| 1932 | 5.1% | |
| 74050 | 80 | 0.2% |
| 191880 | 48 | 0.1% |
| 113030 | 29 | 0.1% |
| 75380 | 26 | 0.1% |
| 79100 | 26 | 0.1% |
| 107240 | 25 | 0.1% |
| 94310 | 25 | 0.1% |
| 203990 | 23 | 0.1% |
| 63730 | 23 | 0.1% |
| Other values (13277) | 35372 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 49517 | |
| 1 | 23544 | |
| 6 | 16191 | 8.4% |
| 5 | 15479 | 8.0% |
| 7 | 15433 | 8.0% |
| 4 | 14884 | 7.7% |
| 9 | 14233 | 7.4% |
| 8 | 14231 | 7.4% |
| 2 | 13854 | 7.2% |
| 3 | 13491 | 7.0% |
| Other values (2) | 1932 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 192789 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49517 | |
| 1 | 23544 | |
| 6 | 16191 | 8.4% |
| 5 | 15479 | 8.0% |
| 7 | 15433 | 8.0% |
| 4 | 14884 | 7.7% |
| 9 | 14233 | 7.4% |
| 8 | 14231 | 7.4% |
| 2 | 13854 | 7.2% |
| 3 | 13491 | 7.0% |
| Other values (2) | 1932 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 192789 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49517 | |
| 1 | 23544 | |
| 6 | 16191 | 8.4% |
| 5 | 15479 | 8.0% |
| 7 | 15433 | 8.0% |
| 4 | 14884 | 7.7% |
| 9 | 14233 | 7.4% |
| 8 | 14231 | 7.4% |
| 2 | 13854 | 7.2% |
| 3 | 13491 | 7.0% |
| Other values (2) | 1932 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 192789 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 49517 | |
| 1 | 23544 | |
| 6 | 16191 | 8.4% |
| 5 | 15479 | 8.0% |
| 7 | 15433 | 8.0% |
| 4 | 14884 | 7.7% |
| 9 | 14233 | 7.4% |
| 8 | 14231 | 7.4% |
| 2 | 13854 | 7.2% |
| 3 | 13491 | 7.0% |
| Other values (2) | 1932 | 1.0% |
annual
Boolean
Constant Missing
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 34943 |
| Missing (%) | 92.9% |
| Memory size | 1.2 MiB |
| True | 2666 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 2666 | 7.1% |
| (Missing) | 34943 |
hourly
Boolean
Constant Missing
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 37447 |
| Missing (%) | 99.6% |
| Memory size | 1.1 MiB |
| True | 162 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) |
| True | 162 | 0.4% |
| (Missing) | 37447 |
Interactions
Correlations
| area | area_type | o_group | |
|---|---|---|---|
| area | 1.000 | 1.000 | 0.021 |
| area_type | 1.000 | 1.000 | 0.032 |
| o_group | 0.021 | 0.032 | 1.000 |
Missing values
Sample
| area | area_title | area_type | prim_state | naics | naics_title | i_group | own_code | occ_code | occ_title | o_group | tot_emp | emp_prse | jobs_1000 | loc_quotient | pct_total | pct_rpt | h_mean | a_mean | mean_prse | h_pct10 | h_pct25 | h_median | h_pct75 | h_pct90 | a_pct10 | a_pct25 | a_median | a_pct75 | a_pct90 | annual | hourly | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | alabama | 2 | al | 0 | cross-industry | cross-industry | 1235 | 00-0000 | all occupations | total | 2091480 | 0 | 1000 | 1 | NaN | NaN | 26.61 | 55350 | 0.3 | 11.31 | 14.74 | 21.07 | 30.82 | 47.51 | 23520 | 30660 | 43830 | 64110 | 98810 | NaN | NaN |
| 1 | 1 | alabama | 2 | al | 0 | cross-industry | cross-industry | 1235 | 11-0000 | management occupations | major | 110240 | 0.9 | 52.708 | 0.74 | NaN | NaN | 57.05 | 118670 | 0.7 | 24.57 | 35.03 | 48.39 | 68.5 | 98.03 | 51100 | 72870 | 100640 | 142480 | 203900 | NaN | NaN |
| 2 | 1 | alabama | 2 | al | 0 | cross-industry | cross-industry | 1235 | 11-1011 | chief executives | detailed | 830 | 17.6 | 0.396 | 0.29 | NaN | NaN | 99.61 | 207190 | 4.6 | 50.46 | 62.96 | 79.04 | 106.69 | # | 104950 | 130950 | 164400 | 221910 | # | NaN | NaN |
| 3 | 1 | alabama | 2 | al | 0 | cross-industry | cross-industry | 1235 | 11-1021 | general and operations managers | detailed | 32370 | 1.6 | 15.476 | 0.67 | NaN | NaN | 64.8 | 134790 | 1.6 | 24.24 | 35.92 | 51.12 | 78.26 | # | 50410 | 74720 | 106330 | 162780 | # | NaN | NaN |
| 4 | 1 | alabama | 2 | al | 0 | cross-industry | cross-industry | 1235 | 11-1031 | legislators | detailed | 1120 | 8.7 | 0.533 | 3.1 | NaN | NaN | * | 36570 | 3.8 | * | * | * | * | * | 18270 | 20950 | 26990 | 41760 | 63900 | True | NaN |
| 5 | 1 | alabama | 2 | al | 0 | cross-industry | cross-industry | 1235 | 11-2011 | advertising and promotions managers | detailed | 50 | 35.6 | 0.022 | 0.16 | NaN | NaN | 62.59 | 130180 | 4.1 | 40.18 | 41.59 | 64.22 | 69.57 | 72.44 | 83570 | 86500 | 133570 | 144710 | 150670 | NaN | NaN |
| 6 | 1 | alabama | 2 | al | 0 | cross-industry | cross-industry | 1235 | 11-2021 | marketing managers | detailed | 1660 | 7.5 | 0.792 | 0.32 | NaN | NaN | 62.82 | 130660 | 3.1 | 31.42 | 40.36 | 54.31 | 78.64 | 105.29 | 65360 | 83940 | 112960 | 163580 | 219000 | NaN | NaN |
| 7 | 1 | alabama | 2 | al | 0 | cross-industry | cross-industry | 1235 | 11-2022 | sales managers | detailed | 4380 | 6.4 | 2.095 | 0.54 | NaN | NaN | 65.84 | 136950 | 3.8 | 31.78 | 40.29 | 52.28 | 78.55 | # | 66100 | 83790 | 108740 | 163390 | # | NaN | NaN |
| 8 | 1 | alabama | 2 | al | 0 | cross-industry | cross-industry | 1235 | 11-2032 | public relations managers | detailed | 410 | 19.7 | 0.198 | 0.4 | NaN | NaN | 54.46 | 113280 | 6.4 | 25.32 | 36.8 | 44.73 | 61.91 | 88.62 | 52670 | 76540 | 93040 | 128770 | 184330 | NaN | NaN |
| 9 | 1 | alabama | 2 | al | 0 | cross-industry | cross-industry | 1235 | 11-2033 | fundraising managers | detailed | 210 | 9.9 | 0.1 | 0.42 | NaN | NaN | 52.09 | 108350 | 6 | 26.61 | 28.97 | 41.15 | 53.2 | 93.36 | 55350 | 60260 | 85600 | 110650 | 194180 | NaN | NaN |
| area | area_title | area_type | prim_state | naics | naics_title | i_group | own_code | occ_code | occ_title | o_group | tot_emp | emp_prse | jobs_1000 | loc_quotient | pct_total | pct_rpt | h_mean | a_mean | mean_prse | h_pct10 | h_pct25 | h_median | h_pct75 | h_pct90 | a_pct10 | a_pct25 | a_median | a_pct75 | a_pct90 | annual | hourly | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 37599 | 78 | virgin islands | 3 | vi | 0 | cross-industry | cross-industry | 1235 | 53-3051 | bus drivers, school | detailed | 50 | 16.3 | 1.486 | 0.59 | NaN | NaN | 21.92 | 45590 | 8.1 | 16.91 | 18.23 | 18.92 | 25.02 | 31.64 | 35180 | 37920 | 39360 | 52050 | 65820 | NaN | NaN |
| 37600 | 78 | virgin islands | 3 | vi | 0 | cross-industry | cross-industry | 1235 | 53-3053 | shuttle drivers and chauffeurs | detailed | 60 | 2.5 | 1.815 | 1.22 | NaN | NaN | 16.5 | 34320 | 0.5 | 13 | 14.86 | 16.97 | 17.84 | 19.29 | 27030 | 30910 | 35300 | 37110 | 40120 | NaN | NaN |
| 37601 | 78 | virgin islands | 3 | vi | 0 | cross-industry | cross-industry | 1235 | 53-5011 | sailors and marine oilers | detailed | 110 | 28 | 3.163 | 15.55 | NaN | NaN | 14.41 | 29960 | 18.8 | 11.95 | 12.51 | 13.38 | 15.52 | 18.96 | 24850 | 26030 | 27830 | 32280 | 39430 | NaN | NaN |
| 37602 | 78 | virgin islands | 3 | vi | 0 | cross-industry | cross-industry | 1235 | 53-5021 | captains, mates, and pilots of water vessels | detailed | 120 | 12.3 | 3.653 | 15.92 | NaN | NaN | * | * | * | * | * | * | * | * | * | * | * | * | * | NaN | NaN |
| 37603 | 78 | virgin islands | 3 | vi | 0 | cross-industry | cross-industry | 1235 | 53-6031 | automotive and watercraft service attendants | detailed | 40 | 32.2 | 1.172 | 1.84 | NaN | NaN | 13.4 | 27870 | 10.9 | 11.2 | 11.5 | 11.56 | 12.29 | 19.81 | 23300 | 23920 | 24040 | 25560 | 41200 | NaN | NaN |
| 37604 | 78 | virgin islands | 3 | vi | 0 | cross-industry | cross-industry | 1235 | 53-7051 | industrial truck and tractor operators | detailed | 40 | 28.8 | 1.222 | 0.23 | NaN | NaN | 16.71 | 34760 | 15.8 | 14.77 | 15.36 | 15.36 | 17 | 18.91 | 30710 | 31950 | 31950 | 35360 | 39330 | NaN | NaN |
| 37605 | 78 | virgin islands | 3 | vi | 0 | cross-industry | cross-industry | 1235 | 53-7061 | cleaners of vehicles and equipment | detailed | 50 | 24.2 | 1.568 | 0.65 | NaN | NaN | 14.01 | 29140 | 4 | 10.5 | 12 | 13.12 | 14.63 | 18.81 | 21840 | 24960 | 27290 | 30430 | 39120 | NaN | NaN |
| 37606 | 78 | virgin islands | 3 | vi | 0 | cross-industry | cross-industry | 1235 | 53-7062 | laborers and freight, stock, and material movers, hand | detailed | 480 | 7.1 | 14.241 | 0.74 | NaN | NaN | 16.05 | 33390 | 1.8 | 13.45 | 14 | 15.81 | 17.66 | 18.71 | 27970 | 29120 | 32890 | 36740 | 38910 | NaN | NaN |
| 37607 | 78 | virgin islands | 3 | vi | 0 | cross-industry | cross-industry | 1235 | 53-7064 | packers and packagers, hand | detailed | 120 | 11.9 | 3.507 | 0.9 | NaN | NaN | 13.69 | 28470 | 3.7 | 10.76 | 10.94 | 11.98 | 14.93 | 17.99 | 22380 | 22750 | 24920 | 31060 | 37410 | NaN | NaN |
| 37608 | 78 | virgin islands | 3 | vi | 0 | cross-industry | cross-industry | 1235 | 53-7065 | stockers and order fillers | detailed | 440 | 6.6 | 12.816 | 0.71 | NaN | NaN | 13.73 | 28550 | 1.2 | 11.08 | 11.77 | 13.85 | 14.27 | 16.93 | 23040 | 24480 | 28800 | 29680 | 35210 | NaN | NaN |